Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 14266 |
| Missing cells | 2705 |
| Missing cells (%) | 0.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.2 MiB |
| Average record size in memory | 232.0 B |
Variable types
| Categorical | 13 |
|---|---|
| Numeric | 16 |
C OF O STATUS has constant value "CO Issued" | Constant |
HOUSE NO has a high cardinality: 3741 distinct values | High cardinality |
STREET NAME has a high cardinality: 3301 distinct values | High cardinality |
SUBMITTED DATE has a high cardinality: 399 distinct values | High cardinality |
C OF O ISSUANCE DATE has a high cardinality: 14258 distinct values | High cardinality |
APPLICATION NUMBER has a high cardinality: 14265 distinct values | High cardinality |
nta has a high cardinality: 193 distinct values | High cardinality |
ntaName has a high cardinality: 193 distinct values | High cardinality |
BIN is highly correlated with ZIP CODE and 6 other fields | High correlation |
BLOCK is highly correlated with ZIP CODE and 7 other fields | High correlation |
ZIP CODE is highly correlated with BIN and 9 other fields | High correlation |
COMMUNITY BOARD is highly correlated with BIN and 7 other fields | High correlation |
xCoordinate is highly correlated with BLOCK and 2 other fields | High correlation |
yCoordinate is highly correlated with latitude and 1 other fields | High correlation |
latitude is highly correlated with yCoordinate and 1 other fields | High correlation |
longitude is highly correlated with BLOCK and 2 other fields | High correlation |
communityDistrict is highly correlated with BIN and 7 other fields | High correlation |
communityDistrictBoroughCode is highly correlated with BIN and 6 other fields | High correlation |
communityDistrictNumber is highly correlated with BLOCK | High correlation |
cityCouncilDistrict is highly correlated with BIN and 8 other fields | High correlation |
censusTract2010 is highly correlated with BLOCK and 1 other fields | High correlation |
buildingIdentificationNumber is highly correlated with BIN and 6 other fields | High correlation |
bbl is highly correlated with BIN and 8 other fields | High correlation |
BIN is highly correlated with ZIP CODE and 8 other fields | High correlation |
BLOCK is highly correlated with communityDistrictNumber | High correlation |
ZIP CODE is highly correlated with BIN and 8 other fields | High correlation |
COMMUNITY BOARD is highly correlated with BIN and 8 other fields | High correlation |
xCoordinate is highly correlated with ZIP CODE and 1 other fields | High correlation |
yCoordinate is highly correlated with BIN and 7 other fields | High correlation |
latitude is highly correlated with BIN and 7 other fields | High correlation |
longitude is highly correlated with ZIP CODE and 1 other fields | High correlation |
communityDistrict is highly correlated with BIN and 8 other fields | High correlation |
communityDistrictBoroughCode is highly correlated with BIN and 8 other fields | High correlation |
communityDistrictNumber is highly correlated with BLOCK | High correlation |
cityCouncilDistrict is highly correlated with BIN and 8 other fields | High correlation |
buildingIdentificationNumber is highly correlated with BIN and 8 other fields | High correlation |
bbl is highly correlated with BIN and 8 other fields | High correlation |
BIN is highly correlated with ZIP CODE and 6 other fields | High correlation |
BLOCK is highly correlated with censusTract2010 and 1 other fields | High correlation |
ZIP CODE is highly correlated with BIN and 5 other fields | High correlation |
COMMUNITY BOARD is highly correlated with BIN and 6 other fields | High correlation |
xCoordinate is highly correlated with longitude | High correlation |
yCoordinate is highly correlated with latitude and 1 other fields | High correlation |
latitude is highly correlated with yCoordinate and 1 other fields | High correlation |
longitude is highly correlated with xCoordinate | High correlation |
communityDistrict is highly correlated with BIN and 6 other fields | High correlation |
communityDistrictBoroughCode is highly correlated with BIN and 6 other fields | High correlation |
cityCouncilDistrict is highly correlated with BIN and 6 other fields | High correlation |
censusTract2010 is highly correlated with BLOCK | High correlation |
buildingIdentificationNumber is highly correlated with BIN and 5 other fields | High correlation |
bbl is highly correlated with BIN and 7 other fields | High correlation |
C OF O STATUS is highly correlated with communityDistrictBoroughCode and 4 other fields | High correlation |
communityDistrictBoroughCode is highly correlated with C OF O STATUS and 1 other fields | High correlation |
BOROUGH is highly correlated with C OF O STATUS and 1 other fields | High correlation |
JOB TYPE is highly correlated with C OF O STATUS and 1 other fields | High correlation |
JOB FILING NAME is highly correlated with C OF O STATUS and 1 other fields | High correlation |
C OF O FILING TYPE is highly correlated with C OF O STATUS | High correlation |
JOB FILING NAME is highly correlated with JOB TYPE | High correlation |
JOB TYPE is highly correlated with JOB FILING NAME and 1 other fields | High correlation |
BIN is highly correlated with JOB TYPE and 13 other fields | High correlation |
BOROUGH is highly correlated with BIN and 12 other fields | High correlation |
BLOCK is highly correlated with ZIP CODE and 4 other fields | High correlation |
ZIP CODE is highly correlated with BIN and 14 other fields | High correlation |
C OF O FILING TYPE is highly correlated with ZIP CODE | High correlation |
COMMUNITY BOARD is highly correlated with BIN and 14 other fields | High correlation |
xCoordinate is highly correlated with BIN and 14 other fields | High correlation |
yCoordinate is highly correlated with BIN and 13 other fields | High correlation |
latitude is highly correlated with BIN and 13 other fields | High correlation |
longitude is highly correlated with BIN and 14 other fields | High correlation |
communityDistrict is highly correlated with BIN and 12 other fields | High correlation |
communityDistrictBoroughCode is highly correlated with BIN and 12 other fields | High correlation |
communityDistrictNumber is highly correlated with COMMUNITY BOARD and 7 other fields | High correlation |
cityCouncilDistrict is highly correlated with BIN and 13 other fields | High correlation |
censusTract2010 is highly correlated with BIN and 10 other fields | High correlation |
buildingIdentificationNumber is highly correlated with BIN and 11 other fields | High correlation |
bbl is highly correlated with BIN and 14 other fields | High correlation |
buildingIdentificationNumber has 672 (4.7%) missing values | Missing |
bbl has 672 (4.7%) missing values | Missing |
C OF O ISSUANCE DATE is uniformly distributed | Uniform |
APPLICATION NUMBER is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2022-06-30 20:49:57.606245 |
|---|---|
| Analysis finished | 2022-06-30 20:50:48.872980 |
| Duration | 51.27 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| 01 | |
|---|---|
| I1 | 155 |
| 02 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 28532 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 01 |
|---|---|
| 2nd row | 01 |
| 3rd row | 01 |
| 4th row | 01 |
| 5th row | 01 |
Common Values
| Value | Count | Frequency (%) |
| 01 | 14110 | |
| I1 | 155 | 1.1% |
| 02 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 01 | 14110 | |
| i1 | 155 | 1.1% |
| 02 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 14265 | |
| 0 | 14111 | |
| I | 155 | 0.5% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28377 | |
| Uppercase Letter | 155 | 0.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 14265 | |
| 0 | 14111 | |
| 2 | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 155 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28377 | |
| Latin | 155 | 0.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 14265 | |
| 0 | 14111 | |
| 2 | 1 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| I | 155 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 14265 | |
| 0 | 14111 | |
| I | 155 | 0.5% |
| 2 | 1 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| ALTERATION TYPE 1 | |
|---|---|
| NEW BUILDING | |
| Alteration CO | 111 |
| New Building | 32 |
| CO - New Building with Existing Elements to Remain | 12 |
Length
| Max length | 50 |
|---|---|
| Median length | 17 |
| Mean length | 14.78858825 |
| Min length | 12 |
Characters and Unicode
| Total characters | 210974 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ALTERATION TYPE 1 |
|---|---|
| 2nd row | ALTERATION TYPE 1 |
| 3rd row | ALTERATION TYPE 1 |
| 4th row | ALTERATION TYPE 1 |
| 5th row | ALTERATION TYPE 1 |
Common Values
| Value | Count | Frequency (%) |
| ALTERATION TYPE 1 | 7843 | |
| NEW BUILDING | 6268 | |
| Alteration CO | 111 | 0.8% |
| New Building | 32 | 0.2% |
| CO - New Building with Existing Elements to Remain | 12 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| alteration | 7954 | |
| type | 7843 | |
| 1 | 7843 | |
| new | 6312 | |
| building | 6312 | |
| co | 123 | 0.3% |
| 12 | < 0.1% | |
| with | 12 | < 0.1% |
| existing | 12 | < 0.1% |
| elements | 12 | < 0.1% |
| Other values (2) | 24 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 23529 | |
| 22193 | ||
| E | 21978 | |
| N | 20423 | |
| I | 20379 | |
| A | 15797 | 7.5% |
| L | 14111 | 6.7% |
| O | 7966 | 3.8% |
| R | 7855 | 3.7% |
| 1 | 7843 | 3.7% |
| Other values (25) | 48900 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 179231 | |
| Space Separator | 22193 | 10.5% |
| Decimal Number | 7843 | 3.7% |
| Lowercase Letter | 1695 | 0.8% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 23529 | |
| E | 21978 | |
| N | 20423 | |
| I | 20379 | |
| A | 15797 | |
| L | 14111 | |
| O | 7966 | 4.4% |
| R | 7855 | 4.4% |
| P | 7843 | 4.4% |
| Y | 7843 | 4.4% |
| Other values (6) | 31507 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 270 | |
| i | 247 | |
| e | 191 | |
| n | 191 | |
| l | 167 | |
| a | 123 | |
| o | 123 | |
| r | 111 | |
| w | 56 | 3.3% |
| g | 56 | 3.3% |
| Other values (6) | 160 |
Space Separator
| Value | Count | Frequency (%) |
| 22193 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7843 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 180926 | |
| Common | 30048 | 14.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 23529 | |
| E | 21978 | |
| N | 20423 | |
| I | 20379 | |
| A | 15797 | |
| L | 14111 | |
| O | 7966 | 4.4% |
| R | 7855 | 4.3% |
| P | 7843 | 4.3% |
| Y | 7843 | 4.3% |
| Other values (22) | 33202 |
Common
| Value | Count | Frequency (%) |
| 22193 | ||
| 1 | 7843 | 26.1% |
| - | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 210974 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 23529 | |
| 22193 | ||
| E | 21978 | |
| N | 20423 | |
| I | 20379 | |
| A | 15797 | 7.5% |
| L | 14111 | 6.7% |
| O | 7966 | 3.8% |
| R | 7855 | 3.7% |
| 1 | 7843 | 3.7% |
| Other values (25) | 48900 |
| Distinct | 7513 |
|---|---|
| Distinct (%) | 52.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2814861.658 |
| Minimum | 1000003 |
|---|---|
| Maximum | 5863352 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 1000003 |
|---|---|
| 5-th percentile | 1009431.5 |
| Q1 | 1088634.75 |
| median | 3072409 |
| Q3 | 4051863.5 |
| 95-th percentile | 5068897 |
| Maximum | 5863352 |
| Range | 4863349 |
| Interquartile range (IQR) | 2963228.75 |
Descriptive statistics
| Standard deviation | 1410278.23 |
|---|---|
| Coefficient of variation (CV) | 0.5010115599 |
| Kurtosis | -1.343764029 |
| Mean | 2814861.658 |
| Median Absolute Deviation (MAD) | 1207568.5 |
| Skewness | -0.0166120269 |
| Sum | 4.015681641 × 1010 |
| Variance | 1.988884686 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3335884 | 14 | 0.1% |
| 3000088 | 13 | 0.1% |
| 5863165 | 12 | 0.1% |
| 3000417 | 11 | 0.1% |
| 3000171 | 10 | 0.1% |
| 3185543 | 10 | 0.1% |
| 1080824 | 10 | 0.1% |
| 4003540 | 9 | 0.1% |
| 3396962 | 9 | 0.1% |
| 2129224 | 8 | 0.1% |
| Other values (7503) | 14160 |
| Value | Count | Frequency (%) |
| 1000003 | 5 | |
| 1000005 | 2 | < 0.1% |
| 1000037 | 2 | < 0.1% |
| 1000045 | 4 | |
| 1000057 | 2 | < 0.1% |
| 1000058 | 1 | < 0.1% |
| 1000060 | 6 | |
| 1000809 | 4 | |
| 1000811 | 7 | |
| 1000813 | 3 |
| Value | Count | Frequency (%) |
| 5863352 | 1 | < 0.1% |
| 5863165 | 12 | |
| 5820649 | 1 | < 0.1% |
| 5817209 | 1 | < 0.1% |
| 5816598 | 1 | < 0.1% |
| 5814826 | 1 | < 0.1% |
| 5810325 | 1 | < 0.1% |
| 5175558 | 1 | < 0.1% |
| 5175463 | 1 | < 0.1% |
| 5175352 | 1 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| MANHATTAN | |
|---|---|
| BROOKLYN | |
| QUEENS | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.988924716 |
| Min length | 5 |
Characters and Unicode
| Total characters | 113970 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MANHATTAN |
|---|---|
| 2nd row | MANHATTAN |
| 3rd row | MANHATTAN |
| 4th row | MANHATTAN |
| 5th row | MANHATTAN |
Common Values
| Value | Count | Frequency (%) |
| MANHATTAN | 4548 | |
| BROOKLYN | 4529 | |
| QUEENS | 3045 | |
| BRONX | 1167 | 8.2% |
| STATEN ISLAND | 977 | 6.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| manhattan | 4548 | |
| brooklyn | 4529 | |
| queens | 3045 | |
| bronx | 1167 | 7.7% |
| staten | 977 | 6.4% |
| island | 977 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 19791 | |
| A | 15598 | |
| T | 11050 | |
| O | 10225 | |
| E | 7067 | 6.2% |
| B | 5696 | 5.0% |
| R | 5696 | 5.0% |
| L | 5506 | 4.8% |
| S | 4999 | 4.4% |
| M | 4548 | 4.0% |
| Other values (9) | 23794 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 112993 | |
| Space Separator | 977 | 0.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 19791 | |
| A | 15598 | |
| T | 11050 | |
| O | 10225 | |
| E | 7067 | 6.3% |
| B | 5696 | 5.0% |
| R | 5696 | 5.0% |
| L | 5506 | 4.9% |
| S | 4999 | 4.4% |
| M | 4548 | 4.0% |
| Other values (8) | 22817 |
Space Separator
| Value | Count | Frequency (%) |
| 977 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 112993 | |
| Common | 977 | 0.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 19791 | |
| A | 15598 | |
| T | 11050 | |
| O | 10225 | |
| E | 7067 | 6.3% |
| B | 5696 | 5.0% |
| R | 5696 | 5.0% |
| L | 5506 | 4.9% |
| S | 4999 | 4.4% |
| M | 4548 | 4.0% |
| Other values (8) | 22817 |
Common
| Value | Count | Frequency (%) |
| 977 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 113970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 19791 | |
| A | 15598 | |
| T | 11050 | |
| O | 10225 | |
| E | 7067 | 6.2% |
| B | 5696 | 5.0% |
| R | 5696 | 5.0% |
| L | 5506 | 4.8% |
| S | 4999 | 4.4% |
| M | 4548 | 4.0% |
| Other values (9) | 23794 |
| Distinct | 3741 |
|---|---|
| Distinct (%) | 26.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| 1 | 125 |
|---|---|
| 11 | 76 |
| 20 | 76 |
| 200 | 64 |
| 100 | 63 |
| Other values (3736) |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 3.523482406 |
| Min length | 1 |
Characters and Unicode
| Total characters | 50266 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1801 ? |
|---|---|
| Unique (%) | 12.6% |
Sample
| 1st row | 10 |
|---|---|
| 2nd row | 10 |
| 3rd row | 10 |
| 4th row | 10 |
| 5th row | 10 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 125 | 0.9% |
| 11 | 76 | 0.5% |
| 20 | 76 | 0.5% |
| 200 | 64 | 0.4% |
| 100 | 63 | 0.4% |
| 25 | 62 | 0.4% |
| 40 | 58 | 0.4% |
| 55 | 57 | 0.4% |
| 45 | 56 | 0.4% |
| 10 | 55 | 0.4% |
| Other values (3731) | 13574 |
Length
| Value | Count | Frequency (%) |
| 1 | 125 | 0.9% |
| gar | 104 | 0.7% |
| 20 | 77 | 0.5% |
| 11 | 76 | 0.5% |
| 200 | 64 | 0.4% |
| 100 | 63 | 0.4% |
| 25 | 63 | 0.4% |
| 55 | 59 | 0.4% |
| 40 | 58 | 0.4% |
| 45 | 56 | 0.4% |
| Other values (3613) | 13684 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 9163 | |
| 2 | 6046 | |
| 0 | 5326 | |
| 5 | 4993 | |
| 3 | 4755 | |
| 4 | 4212 | |
| 6 | 3277 | 6.5% |
| 7 | 3001 | 6.0% |
| 8 | 2891 | 5.8% |
| - | 2829 | 5.6% |
| Other values (17) | 3773 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46295 | |
| Dash Punctuation | 2829 | 5.6% |
| Uppercase Letter | 968 | 1.9% |
| Space Separator | 163 | 0.3% |
| Other Punctuation | 6 | < 0.1% |
| Lowercase Letter | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9163 | |
| 2 | 6046 | |
| 0 | 5326 | |
| 5 | 4993 | |
| 3 | 4755 | |
| 4 | 4212 | |
| 6 | 3277 | 7.1% |
| 7 | 3001 | 6.5% |
| 8 | 2891 | 6.2% |
| 9 | 2631 | 5.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 393 | |
| G | 274 | |
| R | 237 | |
| E | 41 | 4.2% |
| B | 13 | 1.3% |
| X | 3 | 0.3% |
| C | 3 | 0.3% |
| P | 2 | 0.2% |
| I | 1 | 0.1% |
| D | 1 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| r | 1 | |
| g | 1 | |
| e | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2829 |
Space Separator
| Value | Count | Frequency (%) |
| 163 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49293 | |
| Latin | 973 | 1.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 393 | |
| G | 274 | |
| R | 237 | |
| E | 41 | 4.2% |
| B | 13 | 1.3% |
| X | 3 | 0.3% |
| C | 3 | 0.3% |
| P | 2 | 0.2% |
| a | 2 | 0.2% |
| I | 1 | 0.1% |
| Other values (4) | 4 | 0.4% |
Common
| Value | Count | Frequency (%) |
| 1 | 9163 | |
| 2 | 6046 | |
| 0 | 5326 | |
| 5 | 4993 | |
| 3 | 4755 | |
| 4 | 4212 | |
| 6 | 3277 | 6.6% |
| 7 | 3001 | 6.1% |
| 8 | 2891 | 5.9% |
| - | 2829 | 5.7% |
| Other values (3) | 2800 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50266 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 9163 | |
| 2 | 6046 | |
| 0 | 5326 | |
| 5 | 4993 | |
| 3 | 4755 | |
| 4 | 4212 | |
| 6 | 3277 | 6.5% |
| 7 | 3001 | 6.0% |
| 8 | 2891 | 5.8% |
| - | 2829 | 5.6% |
| Other values (17) | 3773 |
| Distinct | 3301 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| BROADWAY | 313 |
|---|---|
| MADISON AVENUE | 106 |
| PARK AVENUE | 81 |
| FIFTH AVENUE | 63 |
| FLATBUSH AVENUE | 62 |
| Other values (3296) |
Length
| Max length | 29 |
|---|---|
| Median length | 23 |
| Mean length | 12.9249264 |
| Min length | 3 |
Characters and Unicode
| Total characters | 184387 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1171 ? |
|---|---|
| Unique (%) | 8.2% |
Sample
| 1st row | SOUTH STREET |
|---|---|
| 2nd row | SOUTH STREET |
| 3rd row | SOUTH STREET |
| 4th row | SOUTH STREET |
| 5th row | SOUTH STREET |
Common Values
| Value | Count | Frequency (%) |
| BROADWAY | 313 | 2.2% |
| MADISON AVENUE | 106 | 0.7% |
| PARK AVENUE | 81 | 0.6% |
| FIFTH AVENUE | 63 | 0.4% |
| FLATBUSH AVENUE | 62 | 0.4% |
| 5TH AVENUE | 57 | 0.4% |
| SCHROEDERS AVENUE | 55 | 0.4% |
| BEDFORD AVENUE | 50 | 0.4% |
| FULTON STREET | 48 | 0.3% |
| HUDSON STREET | 46 | 0.3% |
| Other values (3291) | 13385 |
Length
| Value | Count | Frequency (%) |
| street | 6175 | 19.5% |
| avenue | 4050 | 12.8% |
| west | 1340 | 4.2% |
| east | 1133 | 3.6% |
| ave | 924 | 2.9% |
| st | 611 | 1.9% |
| road | 449 | 1.4% |
| place | 390 | 1.2% |
| broadway | 344 | 1.1% |
| blvd | 270 | 0.9% |
| Other values (1765) | 16036 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 31435 | |
| T | 21508 | |
| 18319 | ||
| A | 13409 | 7.3% |
| R | 13127 | 7.1% |
| S | 13084 | 7.1% |
| N | 9934 | 5.4% |
| V | 6310 | 3.4% |
| U | 5958 | 3.2% |
| O | 5897 | 3.2% |
| Other values (37) | 45406 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 154682 | |
| Space Separator | 18319 | 9.9% |
| Decimal Number | 11170 | 6.1% |
| Other Punctuation | 202 | 0.1% |
| Lowercase Letter | 14 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 31435 | |
| T | 21508 | |
| A | 13409 | |
| R | 13127 | |
| S | 13084 | |
| N | 9934 | 6.4% |
| V | 6310 | 4.1% |
| U | 5958 | 3.9% |
| O | 5897 | 3.8% |
| H | 4351 | 2.8% |
| Other values (16) | 29669 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2150 | |
| 2 | 1584 | |
| 3 | 1214 | |
| 5 | 1212 | |
| 4 | 1177 | |
| 6 | 936 | |
| 7 | 854 | 7.6% |
| 8 | 786 | 7.0% |
| 9 | 694 | 6.2% |
| 0 | 563 | 5.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4 | |
| n | 3 | |
| e | 2 | |
| u | 1 | 7.1% |
| i | 1 | 7.1% |
| g | 1 | 7.1% |
| o | 1 | 7.1% |
| r | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 184 | |
| ' | 18 | 8.9% |
Space Separator
| Value | Count | Frequency (%) |
| 18319 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 154696 | |
| Common | 29691 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 31435 | |
| T | 21508 | |
| A | 13409 | |
| R | 13127 | |
| S | 13084 | |
| N | 9934 | 6.4% |
| V | 6310 | 4.1% |
| U | 5958 | 3.9% |
| O | 5897 | 3.8% |
| H | 4351 | 2.8% |
| Other values (24) | 29683 |
Common
| Value | Count | Frequency (%) |
| 18319 | ||
| 1 | 2150 | 7.2% |
| 2 | 1584 | 5.3% |
| 3 | 1214 | 4.1% |
| 5 | 1212 | 4.1% |
| 4 | 1177 | 4.0% |
| 6 | 936 | 3.2% |
| 7 | 854 | 2.9% |
| 8 | 786 | 2.6% |
| 9 | 694 | 2.3% |
| Other values (3) | 765 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 184387 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 31435 | |
| T | 21508 | |
| 18319 | ||
| A | 13409 | 7.3% |
| R | 13127 | 7.1% |
| S | 13084 | 7.1% |
| N | 9934 | 5.4% |
| V | 6310 | 3.4% |
| U | 5958 | 3.2% |
| O | 5897 | 3.2% |
| Other values (37) | 45406 |
| Distinct | 3987 |
|---|---|
| Distinct (%) | 28.0% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3110.363222 |
| Minimum | 1 |
|---|---|
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 140 |
| Q1 | 837 |
| median | 1886 |
| Q3 | 4543 |
| 95-th percentile | 9903 |
| Maximum | 99999 |
| Range | 99998 |
| Interquartile range (IQR) | 3706 |
Descriptive statistics
| Standard deviation | 3867.934029 |
|---|---|
| Coefficient of variation (CV) | 1.243563453 |
| Kurtosis | 193.3490579 |
| Mean | 3110.363222 |
| Median Absolute Deviation (MAD) | 1327 |
| Skewness | 8.727046272 |
| Sum | 44366221 |
| Variance | 14960913.65 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54 | 116 | 0.8% |
| 4452 | 96 | 0.7% |
| 1171 | 53 | 0.4% |
| 5295 | 42 | 0.3% |
| 4586 | 38 | 0.3% |
| 972 | 37 | 0.3% |
| 16350 | 36 | 0.3% |
| 2023 | 32 | 0.2% |
| 171 | 31 | 0.2% |
| 1222 | 29 | 0.2% |
| Other values (3977) | 13754 |
| Value | Count | Frequency (%) |
| 1 | 17 | |
| 2 | 9 | |
| 4 | 2 | < 0.1% |
| 6 | 16 | |
| 7 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| 13 | 4 | < 0.1% |
| 15 | 16 | |
| 16 | 21 | |
| 17 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 7 | < 0.1% |
| 16350 | 36 | |
| 16340 | 6 | < 0.1% |
| 16319 | 1 | < 0.1% |
| 16294 | 1 | < 0.1% |
| 16285 | 2 | < 0.1% |
| 16274 | 1 | < 0.1% |
| 16264 | 1 | < 0.1% |
| 16242 | 1 | < 0.1% |
| 16231 | 1 | < 0.1% |
LOT
Real number (ℝ≥0)
| Distinct | 367 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1173.986748 |
| Minimum | 0 |
|---|---|
| Maximum | 9021 |
| Zeros | 24 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 15 |
| median | 38 |
| Q3 | 86 |
| 95-th percentile | 7502 |
| Maximum | 9021 |
| Range | 9021 |
| Interquartile range (IQR) | 71 |
Descriptive statistics
| Standard deviation | 2669.251182 |
|---|---|
| Coefficient of variation (CV) | 2.27366381 |
| Kurtosis | 1.808049689 |
| Mean | 1173.986748 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | 1.949326048 |
| Sum | 16743399 |
| Variance | 7124901.872 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1349 | 9.5% |
| 7501 | 971 | 6.8% |
| 7502 | 501 | 3.5% |
| 7503 | 255 | 1.8% |
| 10 | 207 | 1.5% |
| 6 | 196 | 1.4% |
| 5 | 186 | 1.3% |
| 7 | 185 | 1.3% |
| 29 | 184 | 1.3% |
| 21 | 184 | 1.3% |
| Other values (357) | 10044 |
| Value | Count | Frequency (%) |
| 0 | 24 | 0.2% |
| 1 | 1349 | |
| 2 | 158 | 1.1% |
| 3 | 143 | 1.0% |
| 4 | 123 | 0.9% |
| 5 | 186 | 1.3% |
| 6 | 196 | 1.4% |
| 7 | 185 | 1.3% |
| 8 | 170 | 1.2% |
| 9 | 142 | 1.0% |
| Value | Count | Frequency (%) |
| 9021 | 1 | < 0.1% |
| 9001 | 5 | < 0.1% |
| 7517 | 1 | < 0.1% |
| 7514 | 2 | < 0.1% |
| 7513 | 6 | < 0.1% |
| 7512 | 14 | |
| 7511 | 18 | |
| 7510 | 8 | 0.1% |
| 7509 | 12 | |
| 7508 | 26 |
| Distinct | 202 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10738.76937 |
| Minimum | 10001 |
|---|---|
| Maximum | 11697 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 10001 |
|---|---|
| 5-th percentile | 10005 |
| Q1 | 10027 |
| median | 11101 |
| Q3 | 11226 |
| 95-th percentile | 11418 |
| Maximum | 11697 |
| Range | 1696 |
| Interquartile range (IQR) | 1199 |
Descriptive statistics
| Standard deviation | 580.6583289 |
|---|---|
| Coefficient of variation (CV) | 0.05407121702 |
| Kurtosis | -1.744492316 |
| Mean | 10738.76937 |
| Median Absolute Deviation (MAD) | 331 |
| Skewness | -0.1747590919 |
| Sum | 153188545 |
| Variance | 337164.0949 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11101 | 447 | 3.1% |
| 11201 | 440 | 3.1% |
| 10013 | 372 | 2.6% |
| 10019 | 271 | 1.9% |
| 10011 | 255 | 1.8% |
| 10001 | 249 | 1.7% |
| 11211 | 248 | 1.7% |
| 10003 | 236 | 1.7% |
| 11217 | 216 | 1.5% |
| 11238 | 215 | 1.5% |
| Other values (192) | 11316 |
| Value | Count | Frequency (%) |
| 10001 | 249 | |
| 10002 | 155 | |
| 10003 | 236 | |
| 10004 | 61 | 0.4% |
| 10005 | 44 | 0.3% |
| 10006 | 18 | 0.1% |
| 10007 | 69 | 0.5% |
| 10009 | 97 | 0.7% |
| 10010 | 162 | |
| 10011 | 255 |
| Value | Count | Frequency (%) |
| 11697 | 42 | |
| 11694 | 27 | 0.2% |
| 11693 | 19 | 0.1% |
| 11692 | 32 | 0.2% |
| 11691 | 78 | |
| 11436 | 19 | 0.1% |
| 11435 | 63 | |
| 11434 | 100 | |
| 11433 | 43 | |
| 11432 | 83 |
| Distinct | 399 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| 12/08/2021 12:00:00 AM | 92 |
|---|---|
| 05/04/2021 12:00:00 AM | 88 |
| 05/25/2021 12:00:00 AM | 86 |
| 01/11/2022 12:00:00 AM | 83 |
| 05/05/2021 12:00:00 AM | 83 |
| Other values (394) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 313852 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 01/25/2022 12:00:00 AM |
|---|---|
| 2nd row | 01/27/2022 12:00:00 AM |
| 3rd row | 05/03/2021 12:00:00 AM |
| 4th row | 08/13/2021 12:00:00 AM |
| 5th row | 11/16/2021 12:00:00 AM |
Common Values
| Value | Count | Frequency (%) |
| 12/08/2021 12:00:00 AM | 92 | 0.6% |
| 05/04/2021 12:00:00 AM | 88 | 0.6% |
| 05/25/2021 12:00:00 AM | 86 | 0.6% |
| 01/11/2022 12:00:00 AM | 83 | 0.6% |
| 05/05/2021 12:00:00 AM | 83 | 0.6% |
| 12/20/2021 12:00:00 AM | 81 | 0.6% |
| 12/21/2021 12:00:00 AM | 77 | 0.5% |
| 03/10/2022 12:00:00 AM | 77 | 0.5% |
| 05/12/2021 12:00:00 AM | 77 | 0.5% |
| 05/10/2021 12:00:00 AM | 77 | 0.5% |
| Other values (389) | 13445 |
Length
| Value | Count | Frequency (%) |
| am | 14266 | |
| 12:00:00 | 14266 | |
| 12/08/2021 | 92 | 0.2% |
| 05/04/2021 | 88 | 0.2% |
| 05/25/2021 | 86 | 0.2% |
| 01/11/2022 | 83 | 0.2% |
| 05/05/2021 | 83 | 0.2% |
| 12/20/2021 | 81 | 0.2% |
| 05/12/2021 | 77 | 0.2% |
| 05/10/2021 | 77 | 0.2% |
| Other values (391) | 13599 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 89168 | |
| 2 | 55016 | |
| 1 | 36198 | |
| / | 28532 | 9.1% |
| 28532 | 9.1% | |
| : | 28532 | 9.1% |
| A | 14266 | 4.5% |
| M | 14266 | 4.5% |
| 3 | 3579 | 1.1% |
| 4 | 3229 | 1.0% |
| Other values (5) | 12534 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 199724 | |
| Other Punctuation | 57064 | 18.2% |
| Space Separator | 28532 | 9.1% |
| Uppercase Letter | 28532 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 89168 | |
| 2 | 55016 | |
| 1 | 36198 | |
| 3 | 3579 | 1.8% |
| 4 | 3229 | 1.6% |
| 5 | 2823 | 1.4% |
| 8 | 2704 | 1.4% |
| 9 | 2446 | 1.2% |
| 7 | 2437 | 1.2% |
| 6 | 2124 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 28532 | |
| : | 28532 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 14266 | |
| M | 14266 |
Space Separator
| Value | Count | Frequency (%) |
| 28532 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 285320 | |
| Latin | 28532 | 9.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 89168 | |
| 2 | 55016 | |
| 1 | 36198 | |
| / | 28532 | 10.0% |
| 28532 | 10.0% | |
| : | 28532 | 10.0% |
| 3 | 3579 | 1.3% |
| 4 | 3229 | 1.1% |
| 5 | 2823 | 1.0% |
| 8 | 2704 | 0.9% |
| Other values (3) | 7007 | 2.5% |
Latin
| Value | Count | Frequency (%) |
| A | 14266 | |
| M | 14266 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 313852 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 89168 | |
| 2 | 55016 | |
| 1 | 36198 | |
| / | 28532 | 9.1% |
| 28532 | 9.1% | |
| : | 28532 | 9.1% |
| A | 14266 | 4.5% |
| M | 14266 | 4.5% |
| 3 | 3579 | 1.1% |
| 4 | 3229 | 1.0% |
| Other values (5) | 12534 | 4.0% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| CO Issued |
|---|
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 128394 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CO Issued |
|---|---|
| 2nd row | CO Issued |
| 3rd row | CO Issued |
| 4th row | CO Issued |
| 5th row | CO Issued |
Common Values
| Value | Count | Frequency (%) |
| CO Issued | 14266 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| co | 14266 | |
| issued | 14266 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 28532 | |
| C | 14266 | |
| O | 14266 | |
| 14266 | ||
| I | 14266 | |
| u | 14266 | |
| e | 14266 | |
| d | 14266 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 71330 | |
| Uppercase Letter | 42798 | |
| Space Separator | 14266 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 28532 | |
| u | 14266 | |
| e | 14266 | |
| d | 14266 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 14266 | |
| O | 14266 | |
| I | 14266 |
Space Separator
| Value | Count | Frequency (%) |
| 14266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 114128 | |
| Common | 14266 | 11.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 28532 | |
| C | 14266 | |
| O | 14266 | |
| I | 14266 | |
| u | 14266 | |
| e | 14266 | |
| d | 14266 |
Common
| Value | Count | Frequency (%) |
| 14266 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128394 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 28532 | |
| C | 14266 | |
| O | 14266 | |
| 14266 | ||
| I | 14266 | |
| u | 14266 | |
| e | 14266 | |
| d | 14266 |
C OF O SEQUENCE #
Real number (ℝ≥0)
| Distinct | 14265 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9527.692486 |
| Minimum | 13 |
|---|---|
| Maximum | 19520 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 1348.25 |
| Q1 | 4951.25 |
| median | 9503.5 |
| Q3 | 14006.5 |
| 95-th percentile | 17907.75 |
| Maximum | 19520 |
| Range | 19507 |
| Interquartile range (IQR) | 9055.25 |
Descriptive statistics
| Standard deviation | 5286.289072 |
|---|---|
| Coefficient of variation (CV) | 0.5548341406 |
| Kurtosis | -1.151682768 |
| Mean | 9527.692486 |
| Median Absolute Deviation (MAD) | 4528.5 |
| Skewness | 0.02737456492 |
| Sum | 135922061 |
| Variance | 27944852.16 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16833 | 2 | < 0.1% |
| 2049 | 1 | < 0.1% |
| 5400 | 1 | < 0.1% |
| 9470 | 1 | < 0.1% |
| 17666 | 1 | < 0.1% |
| 5384 | 1 | < 0.1% |
| 7433 | 1 | < 0.1% |
| 1290 | 1 | < 0.1% |
| 3339 | 1 | < 0.1% |
| 13580 | 1 | < 0.1% |
| Other values (14255) | 14255 |
| Value | Count | Frequency (%) |
| 13 | 1 | |
| 15 | 1 | |
| 16 | 1 | |
| 17 | 1 | |
| 18 | 1 | |
| 19 | 1 | |
| 21 | 1 | |
| 24 | 1 | |
| 44 | 1 | |
| 45 | 1 |
| Value | Count | Frequency (%) |
| 19520 | 1 | |
| 19519 | 1 | |
| 19515 | 1 | |
| 19494 | 1 | |
| 19493 | 1 | |
| 19490 | 1 | |
| 19483 | 1 | |
| 19480 | 1 | |
| 19472 | 1 | |
| 19471 | 1 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 111.6 KiB |
| Renewal Without Change | |
|---|---|
| Final | |
| Initial | |
| Renewal With Change |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 14.57074744 |
| Min length | 5 |
Characters and Unicode
| Total characters | 207808 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Renewal Without Change |
|---|---|
| 2nd row | Renewal With Change |
| 3rd row | Renewal With Change |
| 4th row | Renewal Without Change |
| 5th row | Renewal Without Change |
Common Values
| Value | Count | Frequency (%) |
| Renewal Without Change | 6822 | |
| Final | 4180 | |
| Initial | 2093 | 14.7% |
| Renewal With Change | 1167 | 8.2% |
| (Missing) | 4 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| renewal | 7989 | |
| change | 7989 | |
| without | 6822 | |
| final | 4180 | |
| initial | 2093 | 6.9% |
| with | 1167 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 23967 | |
| n | 22251 | |
| a | 22251 | |
| t | 16904 | 8.1% |
| i | 16355 | 7.9% |
| h | 15978 | 7.7% |
| 15978 | 7.7% | |
| l | 14262 | 6.9% |
| g | 7989 | 3.8% |
| C | 7989 | 3.8% |
| Other values (7) | 43884 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161590 | |
| Uppercase Letter | 30240 | 14.6% |
| Space Separator | 15978 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 23967 | |
| n | 22251 | |
| a | 22251 | |
| t | 16904 | |
| i | 16355 | |
| h | 15978 | |
| l | 14262 | |
| g | 7989 | 4.9% |
| w | 7989 | 4.9% |
| o | 6822 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7989 | |
| R | 7989 | |
| W | 7989 | |
| F | 4180 | |
| I | 2093 | 6.9% |
Space Separator
| Value | Count | Frequency (%) |
| 15978 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 191830 | |
| Common | 15978 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 23967 | |
| n | 22251 | |
| a | 22251 | |
| t | 16904 | |
| i | 16355 | |
| h | 15978 | |
| l | 14262 | 7.4% |
| g | 7989 | 4.2% |
| C | 7989 | 4.2% |
| R | 7989 | 4.2% |
| Other values (6) | 35895 |
Common
| Value | Count | Frequency (%) |
| 15978 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 207808 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 23967 | |
| n | 22251 | |
| a | 22251 | |
| t | 16904 | 8.1% |
| i | 16355 | 7.9% |
| h | 15978 | 7.7% |
| 15978 | 7.7% | |
| l | 14262 | 6.9% |
| g | 7989 | 3.8% |
| C | 7989 | 3.8% |
| Other values (7) | 43884 |
COMMUNITY BOARD
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 66 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 19 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 268.6559978 |
| Minimum | 1 |
|---|---|
| Maximum | 503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 107 |
| median | 303 |
| Q3 | 402 |
| 95-th percentile | 502 |
| Maximum | 503 |
| Range | 502 |
| Interquartile range (IQR) | 295 |
Descriptive statistics
| Standard deviation | 131.1112884 |
|---|---|
| Coefficient of variation (CV) | 0.4880266567 |
| Kurtosis | -1.249806209 |
| Mean | 268.6559978 |
| Median Absolute Deviation (MAD) | 104 |
| Skewness | 0.01277242913 |
| Sum | 3827542 |
| Variance | 17190.16995 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 1036 | 7.3% |
| 302 | 703 | 4.9% |
| 301 | 683 | 4.8% |
| 102 | 539 | 3.8% |
| 101 | 526 | 3.7% |
| 104 | 520 | 3.6% |
| 402 | 445 | 3.1% |
| 108 | 432 | 3.0% |
| 407 | 431 | 3.0% |
| 303 | 422 | 3.0% |
| Other values (56) | 8510 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
| 4 | 2 | < 0.1% |
| 101 | 526 | |
| 102 | 539 | |
| 103 | 300 | 2.1% |
| 104 | 520 | |
| 105 | 1036 | |
| 106 | 313 | 2.2% |
| Value | Count | Frequency (%) |
| 503 | 407 | |
| 502 | 311 | |
| 501 | 259 | |
| 483 | 2 | < 0.1% |
| 481 | 7 | < 0.1% |
| 414 | 184 | |
| 413 | 169 | |
| 412 | 296 | |
| 411 | 246 | |
| 410 | 117 | 0.8% |
| Distinct | 14258 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| 08/20/21 2:57:36 PM | 2 |
|---|---|
| 10/20/21 3:50:05 PM | 2 |
| 03/15/22 9:19:56 AM | 2 |
| 04/27/22 4:14:34 PM | 2 |
| 03/23/22 12:37:03 PM | 2 |
| Other values (14253) |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Characters and Unicode
| Total characters | 285320 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14250 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | 01/26/22 3:49:10 PM |
|---|---|
| 2nd row | 03/17/22 10:17:02 AM |
| 3rd row | 05/27/21 3:51:49 PM |
| 4th row | 08/20/21 3:25:28 PM |
| 5th row | 11/24/21 9:58:25 AM |
Common Values
| Value | Count | Frequency (%) |
| 08/20/21 2:57:36 PM | 2 | < 0.1% |
| 10/20/21 3:50:05 PM | 2 | < 0.1% |
| 03/15/22 9:19:56 AM | 2 | < 0.1% |
| 04/27/22 4:14:34 PM | 2 | < 0.1% |
| 03/23/22 12:37:03 PM | 2 | < 0.1% |
| 05/05/22 11:02:03 AM | 2 | < 0.1% |
| 12/01/21 2:12:15 PM | 2 | < 0.1% |
| 12/21/21 10:43:29 AM | 2 | < 0.1% |
| 05/10/21 11:23:55 AM | 1 | < 0.1% |
| 05/06/22 4:45:52 PM | 1 | < 0.1% |
| Other values (14248) | 14248 |
Length
| Value | Count | Frequency (%) |
| pm | 9183 | 21.5% |
| am | 5083 | 11.9% |
| 07/19/21 | 99 | 0.2% |
| 12/21/21 | 99 | 0.2% |
| 10/01/21 | 96 | 0.2% |
| 09/29/21 | 92 | 0.2% |
| 10/05/21 | 87 | 0.2% |
| 01/19/22 | 87 | 0.2% |
| 12/22/21 | 87 | 0.2% |
| 11/09/21 | 86 | 0.2% |
| Other values (11813) | 27799 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 39066 | |
| 37580 | ||
| 1 | 37252 | |
| / | 28532 | |
| : | 28532 | |
| 0 | 26631 | |
| M | 14266 | 5.0% |
| 3 | 13251 | 4.6% |
| 4 | 11926 | 4.2% |
| 5 | 10689 | 3.7% |
| Other values (6) | 37595 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 162144 | |
| Other Punctuation | 57064 | 20.0% |
| Space Separator | 37580 | 13.2% |
| Uppercase Letter | 28532 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 39066 | |
| 1 | 37252 | |
| 0 | 26631 | |
| 3 | 13251 | 8.2% |
| 4 | 11926 | 7.4% |
| 5 | 10689 | 6.6% |
| 9 | 6828 | 4.2% |
| 8 | 5902 | 3.6% |
| 7 | 5322 | 3.3% |
| 6 | 5277 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 14266 | |
| P | 9183 | |
| A | 5083 | 17.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 28532 | |
| : | 28532 |
Space Separator
| Value | Count | Frequency (%) |
| 37580 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 256788 | |
| Latin | 28532 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 39066 | |
| 37580 | ||
| 1 | 37252 | |
| / | 28532 | |
| : | 28532 | |
| 0 | 26631 | |
| 3 | 13251 | 5.2% |
| 4 | 11926 | 4.6% |
| 5 | 10689 | 4.2% |
| 9 | 6828 | 2.7% |
| Other values (3) | 16501 |
Latin
| Value | Count | Frequency (%) |
| M | 14266 | |
| P | 9183 | |
| A | 5083 | 17.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 285320 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 39066 | |
| 37580 | ||
| 1 | 37252 | |
| / | 28532 | |
| : | 28532 | |
| 0 | 26631 | |
| M | 14266 | 5.0% |
| 3 | 13251 | 4.6% |
| 4 | 11926 | 4.2% |
| 5 | 10689 | 3.7% |
| Other values (6) | 37595 |
| Distinct | 14265 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| CO-000016833 | 2 |
|---|---|
| CO-000013781 | 1 |
| CO-000012337 | 1 |
| CO-000010264 | 1 |
| CO-000008973 | 1 |
| Other values (14260) |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 171192 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14264 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | CO-000014504 |
|---|---|
| 2nd row | CO-000014626 |
| 3rd row | CO-000002499 |
| 4th row | CO-000006765 |
| 5th row | CO-000011434 |
Common Values
| Value | Count | Frequency (%) |
| CO-000016833 | 2 | < 0.1% |
| CO-000013781 | 1 | < 0.1% |
| CO-000012337 | 1 | < 0.1% |
| CO-000010264 | 1 | < 0.1% |
| CO-000008973 | 1 | < 0.1% |
| CO-000005413 | 1 | < 0.1% |
| CO-000015715 | 1 | < 0.1% |
| CO-000008949 | 1 | < 0.1% |
| CO-000013232 | 1 | < 0.1% |
| CO-000002054 | 1 | < 0.1% |
| Other values (14255) | 14255 |
Length
| Value | Count | Frequency (%) |
| co-000016833 | 2 | < 0.1% |
| co-000003403 | 1 | < 0.1% |
| co-000019205 | 1 | < 0.1% |
| co-000005496 | 1 | < 0.1% |
| co-000003339 | 1 | < 0.1% |
| co-000003955 | 1 | < 0.1% |
| co-000017197 | 1 | < 0.1% |
| co-000007448 | 1 | < 0.1% |
| co-000016385 | 1 | < 0.1% |
| co-000006936 | 1 | < 0.1% |
| Other values (14255) | 14255 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 70138 | |
| C | 14266 | 8.3% |
| O | 14266 | 8.3% |
| - | 14266 | 8.3% |
| 1 | 12536 | 7.3% |
| 3 | 5905 | 3.4% |
| 2 | 5903 | 3.4% |
| 4 | 5881 | 3.4% |
| 6 | 5777 | 3.4% |
| 5 | 5764 | 3.4% |
| Other values (3) | 16490 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 128394 | |
| Uppercase Letter | 28532 | 16.7% |
| Dash Punctuation | 14266 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 70138 | |
| 1 | 12536 | 9.8% |
| 3 | 5905 | 4.6% |
| 2 | 5903 | 4.6% |
| 4 | 5881 | 4.6% |
| 6 | 5777 | 4.5% |
| 5 | 5764 | 4.5% |
| 7 | 5682 | 4.4% |
| 8 | 5642 | 4.4% |
| 9 | 5166 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 14266 | |
| O | 14266 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 142660 | |
| Latin | 28532 | 16.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 70138 | |
| - | 14266 | 10.0% |
| 1 | 12536 | 8.8% |
| 3 | 5905 | 4.1% |
| 2 | 5903 | 4.1% |
| 4 | 5881 | 4.1% |
| 6 | 5777 | 4.0% |
| 5 | 5764 | 4.0% |
| 7 | 5682 | 4.0% |
| 8 | 5642 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| C | 14266 | |
| O | 14266 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 171192 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 70138 | |
| C | 14266 | 8.3% |
| O | 14266 | 8.3% |
| - | 14266 | 8.3% |
| 1 | 12536 | 7.3% |
| 3 | 5905 | 3.4% |
| 2 | 5903 | 3.4% |
| 4 | 5881 | 3.4% |
| 6 | 5777 | 3.4% |
| 5 | 5764 | 3.4% |
| Other values (3) | 16490 | 9.6% |
| Distinct | 6793 |
|---|---|
| Distinct (%) | 48.0% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 998590.134 |
| Minimum | 914661 |
|---|---|
| Maximum | 1066784 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 914661 |
|---|---|
| 5-th percentile | 957653 |
| Q1 | 987029 |
| median | 995560 |
| Q3 | 1009269 |
| 95-th percentile | 1043093.4 |
| Maximum | 1066784 |
| Range | 152123 |
| Interquartile range (IQR) | 22240 |
Descriptive statistics
| Standard deviation | 23487.76873 |
|---|---|
| Coefficient of variation (CV) | 0.02352093009 |
| Kurtosis | 1.560564711 |
| Mean | 998590.134 |
| Median Absolute Deviation (MAD) | 9642 |
| Skewness | -0.08034021147 |
| Sum | 1.412505745 × 1010 |
| Variance | 551675280.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 988128 | 12 | 0.1% |
| 988319 | 12 | 0.1% |
| 995950 | 11 | 0.1% |
| 987264 | 11 | 0.1% |
| 986953 | 10 | 0.1% |
| 988870 | 10 | 0.1% |
| 987234 | 10 | 0.1% |
| 998946 | 9 | 0.1% |
| 995184 | 9 | 0.1% |
| 1009022 | 9 | 0.1% |
| Other values (6783) | 14042 | |
| (Missing) | 121 | 0.8% |
| Value | Count | Frequency (%) |
| 914661 | 1 | |
| 914973 | 1 | |
| 915201 | 1 | |
| 915345 | 1 | |
| 915359 | 1 | |
| 915675 | 1 | |
| 916211 | 1 | |
| 916265 | 2 | |
| 916268 | 2 | |
| 916289 | 1 |
| Value | Count | Frequency (%) |
| 1066784 | 1 | < 0.1% |
| 1066645 | 1 | < 0.1% |
| 1066505 | 1 | < 0.1% |
| 1066494 | 1 | < 0.1% |
| 1066454 | 1 | < 0.1% |
| 1066183 | 1 | < 0.1% |
| 1065715 | 3 | |
| 1065686 | 1 | < 0.1% |
| 1065527 | 2 | |
| 1065401 | 1 | < 0.1% |
| Distinct | 6872 |
|---|---|
| Distinct (%) | 48.6% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200661.1704 |
| Minimum | 121245 |
|---|---|
| Maximum | 271410 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 121245 |
|---|---|
| 5-th percentile | 153589.4 |
| Q1 | 186267 |
| median | 202642 |
| Q3 | 215876 |
| 95-th percentile | 244974.4 |
| Maximum | 271410 |
| Range | 150165 |
| Interquartile range (IQR) | 29609 |
Descriptive statistics
| Standard deviation | 26118.14057 |
|---|---|
| Coefficient of variation (CV) | 0.1301604118 |
| Kurtosis | 0.2426639918 |
| Mean | 200661.1704 |
| Median Absolute Deviation (MAD) | 14566 |
| Skewness | -0.2382758943 |
| Sum | 2838352256 |
| Variance | 682157266.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 193305 | 12 | 0.1% |
| 213876 | 11 | 0.1% |
| 215959 | 11 | 0.1% |
| 191275 | 11 | 0.1% |
| 214687 | 10 | 0.1% |
| 194547 | 10 | 0.1% |
| 197266 | 10 | 0.1% |
| 190555 | 10 | 0.1% |
| 202047 | 10 | 0.1% |
| 157566 | 10 | 0.1% |
| Other values (6862) | 14040 | |
| (Missing) | 121 | 0.8% |
| Value | Count | Frequency (%) |
| 121245 | 1 | |
| 121303 | 2 | |
| 121361 | 1 | |
| 121507 | 1 | |
| 121756 | 2 | |
| 122182 | 1 | |
| 122368 | 1 | |
| 122875 | 1 | |
| 122891 | 1 | |
| 123145 | 1 |
| Value | Count | Frequency (%) |
| 271410 | 1 | |
| 270779 | 1 | |
| 270405 | 2 | |
| 269459 | 1 | |
| 269325 | 2 | |
| 269089 | 1 | |
| 268858 | 1 | |
| 268841 | 1 | |
| 268479 | 1 | |
| 268438 | 1 |
| Distinct | 7048 |
|---|---|
| Distinct (%) | 49.8% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.71740246 |
| Minimum | 40.499212 |
|---|---|
| Maximum | 40.91159 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 40.499212 |
|---|---|
| 5-th percentile | 40.588132 |
| Q1 | 40.677929 |
| median | 40.722874 |
| Q3 | 40.759115 |
| 95-th percentile | 40.8389676 |
| Maximum | 40.91159 |
| Range | 0.412378 |
| Interquartile range (IQR) | 0.081186 |
Descriptive statistics
| Standard deviation | 0.07169515298 |
|---|---|
| Coefficient of variation (CV) | 0.00176079879 |
| Kurtosis | 0.2441285577 |
| Mean | 40.71740246 |
| Median Absolute Deviation (MAD) | 0.039987 |
| Skewness | -0.2393437313 |
| Sum | 575947.6578 |
| Variance | 0.00514019496 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.755942 | 14 | 0.1% |
| 40.697254 | 12 | 0.1% |
| 40.691682 | 11 | 0.1% |
| 40.599158 | 10 | 0.1% |
| 40.721248 | 10 | 0.1% |
| 40.756978 | 9 | 0.1% |
| 40.692069 | 9 | 0.1% |
| 40.74699 | 8 | 0.1% |
| 40.695198 | 8 | 0.1% |
| 40.746979 | 8 | 0.1% |
| Other values (7038) | 14046 | |
| (Missing) | 121 | 0.8% |
| Value | Count | Frequency (%) |
| 40.499212 | 1 | |
| 40.499371 | 2 | |
| 40.499532 | 1 | |
| 40.499937 | 1 | |
| 40.50062 | 2 | |
| 40.501782 | 1 | |
| 40.502294 | 1 | |
| 40.503683 | 1 | |
| 40.503727 | 1 | |
| 40.504446 | 1 |
| Value | Count | Frequency (%) |
| 40.91159 | 1 | |
| 40.909804 | 1 | |
| 40.908837 | 2 | |
| 40.906177 | 1 | |
| 40.905815 | 2 | |
| 40.905221 | 1 | |
| 40.90459 | 1 | |
| 40.904543 | 1 | |
| 40.903553 | 1 | |
| 40.903428 | 1 |
| Distinct | 7045 |
|---|---|
| Distinct (%) | 49.8% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.94823164 |
| Minimum | -74.250237 |
|---|---|
| Maximum | -73.702122 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 14145 |
| Negative (%) | 99.2% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | -74.250237 |
|---|---|
| 5-th percentile | -74.095761 |
| Q1 | -73.989972 |
| median | -73.959151 |
| Q3 | -73.909658 |
| 95-th percentile | -73.7877684 |
| Maximum | -73.702122 |
| Range | 0.548115 |
| Interquartile range (IQR) | 0.080314 |
Descriptive statistics
| Standard deviation | 0.08468585412 |
|---|---|
| Coefficient of variation (CV) | -0.001145204588 |
| Kurtosis | 1.549373853 |
| Mean | -73.94823164 |
| Median Absolute Deviation (MAD) | 0.034827 |
| Skewness | -0.07613728974 |
| Sum | -1045997.737 |
| Variance | 0.007171693888 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.986015 | 12 | 0.1% |
| -73.949507 | 12 | 0.1% |
| -73.989132 | 11 | 0.1% |
| -73.979965 | 10 | 0.1% |
| -73.989255 | 10 | 0.1% |
| -73.983324 | 10 | 0.1% |
| -73.986587 | 10 | 0.1% |
| -73.985798 | 10 | 0.1% |
| -73.983135 | 9 | 0.1% |
| -73.949006 | 9 | 0.1% |
| Other values (7035) | 14042 | |
| (Missing) | 121 | 0.8% |
| Value | Count | Frequency (%) |
| -74.250237 | 1 | |
| -74.249125 | 1 | |
| -74.248307 | 1 | |
| -74.247805 | 1 | |
| -74.247755 | 1 | |
| -74.246593 | 1 | |
| -74.244665 | 1 | |
| -74.244484 | 2 | |
| -74.24448 | 2 | |
| -74.244404 | 1 |
| Value | Count | Frequency (%) |
| -73.702122 | 1 | < 0.1% |
| -73.702641 | 1 | < 0.1% |
| -73.703168 | 1 | < 0.1% |
| -73.703175 | 1 | < 0.1% |
| -73.703322 | 1 | < 0.1% |
| -73.704312 | 1 | < 0.1% |
| -73.705961 | 3 | |
| -73.706166 | 1 | < 0.1% |
| -73.706698 | 2 | |
| -73.70717 | 1 | < 0.1% |
communityDistrict
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 62 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 267.9300813 |
| Minimum | 101 |
|---|---|
| Maximum | 503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 107 |
| median | 302 |
| Q3 | 402 |
| 95-th percentile | 501 |
| Maximum | 503 |
| Range | 402 |
| Interquartile range (IQR) | 295 |
Descriptive statistics
| Standard deviation | 130.320782 |
|---|---|
| Coefficient of variation (CV) | 0.4863984713 |
| Kurtosis | -1.25653878 |
| Mean | 267.9300813 |
| Median Absolute Deviation (MAD) | 105 |
| Skewness | 0.01056934867 |
| Sum | 3789871 |
| Variance | 16983.50621 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 1035 | 7.3% |
| 302 | 703 | 4.9% |
| 301 | 679 | 4.8% |
| 102 | 540 | 3.8% |
| 101 | 522 | 3.7% |
| 104 | 519 | 3.6% |
| 402 | 443 | 3.1% |
| 407 | 435 | 3.0% |
| 108 | 431 | 3.0% |
| 303 | 422 | 3.0% |
| Other values (52) | 8416 |
| Value | Count | Frequency (%) |
| 101 | 522 | |
| 102 | 540 | |
| 103 | 293 | 2.1% |
| 104 | 519 | |
| 105 | 1035 | |
| 106 | 312 | 2.2% |
| 107 | 376 | 2.6% |
| 108 | 431 | |
| 109 | 106 | 0.7% |
| 110 | 202 | 1.4% |
| Value | Count | Frequency (%) |
| 503 | 371 | |
| 502 | 296 | |
| 501 | 242 | |
| 483 | 3 | < 0.1% |
| 481 | 7 | < 0.1% |
| 414 | 198 | |
| 413 | 169 | |
| 412 | 296 | |
| 411 | 246 | |
| 410 | 117 | 0.8% |
communityDistrictBoroughCode
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Memory size | 111.6 KiB |
| 1.0 | |
|---|---|
| 3.0 | |
| 4.0 | |
| 2.0 | |
| 5.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 42435 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 4528 | |
| 3.0 | 4512 | |
| 4.0 | 3030 | |
| 2.0 | 1166 | 8.2% |
| 5.0 | 909 | 6.4% |
| (Missing) | 121 | 0.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1.0 | 4528 | |
| 3.0 | 4512 | |
| 4.0 | 3030 | |
| 2.0 | 1166 | 8.2% |
| 5.0 | 909 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 14145 | |
| 0 | 14145 | |
| 1 | 4528 | 10.7% |
| 3 | 4512 | 10.6% |
| 4 | 3030 | 7.1% |
| 2 | 1166 | 2.7% |
| 5 | 909 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28290 | |
| Other Punctuation | 14145 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 14145 | |
| 1 | 4528 | 16.0% |
| 3 | 4512 | 15.9% |
| 4 | 3030 | 10.7% |
| 2 | 1166 | 4.1% |
| 5 | 909 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14145 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42435 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 14145 | |
| 0 | 14145 | |
| 1 | 4528 | 10.7% |
| 3 | 4512 | 10.6% |
| 4 | 3030 | 7.1% |
| 2 | 1166 | 2.7% |
| 5 | 909 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42435 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 14145 | |
| 0 | 14145 | |
| 1 | 4528 | 10.7% |
| 3 | 4512 | 10.6% |
| 4 | 3030 | 7.1% |
| 2 | 1166 | 2.7% |
| 5 | 909 | 2.1% |
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.922304701 |
| Minimum | 1 |
|---|---|
| Maximum | 83 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 14 |
| Maximum | 83 |
| Range | 82 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.662231008 |
|---|---|
| Coefficient of variation (CV) | 0.7872325459 |
| Kurtosis | 49.61157375 |
| Mean | 5.922304701 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 3.71348615 |
| Sum | 83771 |
| Variance | 21.73639797 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 2047 | |
| 1 | 1957 | |
| 5 | 1578 | |
| 3 | 1268 | |
| 7 | 1100 | |
| 4 | 1017 | |
| 6 | 895 | |
| 8 | 881 | |
| 12 | 802 | 5.6% |
| 11 | 573 | 4.0% |
| Other values (11) | 2027 |
| Value | Count | Frequency (%) |
| 1 | 1957 | |
| 2 | 2047 | |
| 3 | 1268 | |
| 4 | 1017 | |
| 5 | 1578 | |
| 6 | 895 | |
| 7 | 1100 | |
| 8 | 881 | |
| 9 | 397 | 2.8% |
| 10 | 517 | 3.6% |
| Value | Count | Frequency (%) |
| 83 | 3 | < 0.1% |
| 81 | 7 | < 0.1% |
| 55 | 2 | < 0.1% |
| 18 | 96 | 0.7% |
| 17 | 118 | 0.8% |
| 16 | 99 | 0.7% |
| 15 | 158 | 1.1% |
| 14 | 366 | |
| 13 | 264 | 1.9% |
| 12 | 802 |
cityCouncilDistrict
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 51 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.05146695 |
| Minimum | 1 |
|---|---|
| Maximum | 51 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 25 |
| Q3 | 36 |
| 95-th percentile | 50 |
| Maximum | 51 |
| Range | 50 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 16.10849574 |
|---|---|
| Coefficient of variation (CV) | 0.698805667 |
| Kurtosis | -1.353713911 |
| Mean | 23.05146695 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 0.04533145544 |
| Sum | 326063 |
| Variance | 259.4836349 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1035 | 7.3% |
| 4 | 997 | 7.0% |
| 33 | 935 | 6.6% |
| 1 | 895 | 6.3% |
| 26 | 553 | 3.9% |
| 2 | 485 | 3.4% |
| 34 | 472 | 3.3% |
| 39 | 416 | 2.9% |
| 35 | 404 | 2.8% |
| 36 | 395 | 2.8% |
| Other values (41) | 7558 |
| Value | Count | Frequency (%) |
| 1 | 895 | |
| 2 | 485 | |
| 3 | 1035 | |
| 4 | 997 | |
| 5 | 237 | 1.7% |
| 6 | 339 | 2.4% |
| 7 | 128 | 0.9% |
| 8 | 221 | 1.5% |
| 9 | 267 | 1.9% |
| 10 | 72 | 0.5% |
| Value | Count | Frequency (%) |
| 51 | 361 | |
| 50 | 351 | |
| 49 | 197 | |
| 48 | 167 | |
| 47 | 149 | |
| 46 | 60 | 0.4% |
| 45 | 125 | 0.9% |
| 44 | 214 | |
| 43 | 115 | 0.8% |
| 42 | 265 |
| Distinct | 1076 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6140.341534 |
| Minimum | 1 |
|---|---|
| Maximum | 157903 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 91 |
| median | 235 |
| Q3 | 737 |
| 95-th percentile | 36902 |
| Maximum | 157903 |
| Range | 157902 |
| Interquartile range (IQR) | 646 |
Descriptive statistics
| Standard deviation | 20737.84482 |
|---|---|
| Coefficient of variation (CV) | 3.377311296 |
| Kurtosis | 25.3592508 |
| Mean | 6140.341534 |
| Median Absolute Deviation (MAD) | 190 |
| Skewness | 4.809769071 |
| Sum | 86855131 |
| Variance | 430058207.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 208 | 1.5% |
| 21 | 176 | 1.2% |
| 33 | 172 | 1.2% |
| 37 | 155 | 1.1% |
| 99 | 136 | 1.0% |
| 1070 | 119 | 0.8% |
| 19 | 114 | 0.8% |
| 39 | 99 | 0.7% |
| 41 | 93 | 0.7% |
| 137 | 89 | 0.6% |
| Other values (1066) | 12784 | |
| (Missing) | 121 | 0.8% |
| Value | Count | Frequency (%) |
| 1 | 79 | 0.6% |
| 2 | 4 | < 0.1% |
| 3 | 5 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 11 | 0.1% |
| 6 | 15 | 0.1% |
| 7 | 208 | |
| 8 | 30 | 0.2% |
| 9 | 73 | 0.5% |
| 11 | 33 | 0.2% |
| Value | Count | Frequency (%) |
| 157903 | 8 | |
| 157902 | 2 | < 0.1% |
| 157901 | 11 | |
| 157102 | 3 | < 0.1% |
| 157101 | 5 | < 0.1% |
| 155102 | 12 | |
| 152902 | 15 | |
| 152901 | 4 | < 0.1% |
| 150702 | 18 | |
| 150701 | 10 |
buildingIdentificationNumber
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 6140 |
|---|---|
| Distinct (%) | 45.2% |
| Missing | 672 |
| Missing (%) | 4.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2692485.571 |
| Minimum | 1000000 |
|---|---|
| Maximum | 5174460 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 1000000 |
|---|---|
| 5-th percentile | 1005457.05 |
| Q1 | 1082882 |
| median | 3028784.5 |
| Q3 | 4000000 |
| 95-th percentile | 4622970.35 |
| Maximum | 5174460 |
| Range | 4174460 |
| Interquartile range (IQR) | 2917118 |
Descriptive statistics
| Standard deviation | 1346639.023 |
|---|---|
| Coefficient of variation (CV) | 0.5001471641 |
| Kurtosis | -1.348435421 |
| Mean | 2692485.571 |
| Median Absolute Deviation (MAD) | 1100558 |
| Skewness | 0.01078959736 |
| Sum | 3.660164885 × 1010 |
| Variance | 1.813436657 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3000000 | 605 | 4.2% |
| 4000000 | 453 | 3.2% |
| 1000000 | 229 | 1.6% |
| 2000000 | 187 | 1.3% |
| 5000000 | 166 | 1.2% |
| 3335884 | 14 | 0.1% |
| 3000088 | 13 | 0.1% |
| 3000417 | 11 | 0.1% |
| 2128425 | 10 | 0.1% |
| 3000171 | 10 | 0.1% |
| Other values (6130) | 11896 | |
| (Missing) | 672 | 4.7% |
| Value | Count | Frequency (%) |
| 1000000 | 229 | |
| 1000003 | 5 | < 0.1% |
| 1000005 | 2 | < 0.1% |
| 1000037 | 2 | < 0.1% |
| 1000045 | 4 | < 0.1% |
| 1000057 | 2 | < 0.1% |
| 1000058 | 1 | < 0.1% |
| 1000060 | 6 | < 0.1% |
| 1000797 | 4 | < 0.1% |
| 1000809 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 5174460 | 1 | < 0.1% |
| 5174442 | 3 | |
| 5171746 | 1 | < 0.1% |
| 5171745 | 1 | < 0.1% |
| 5171743 | 1 | < 0.1% |
| 5171480 | 1 | < 0.1% |
| 5171479 | 1 | < 0.1% |
| 5171389 | 1 | < 0.1% |
| 5171388 | 1 | < 0.1% |
| 5171133 | 1 | < 0.1% |
| Distinct | 6719 |
|---|---|
| Distinct (%) | 49.4% |
| Missing | 672 |
| Missing (%) | 4.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2594417254 |
| Minimum | 0 |
|---|---|
| Maximum | 5080460194 |
| Zeros | 25 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 111.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1003507503 |
| Q1 | 1013030014 |
| median | 3012750007 |
| Q3 | 4000823128 |
| 95-th percentile | 4163500400 |
| Maximum | 5080460194 |
| Range | 5080460194 |
| Interquartile range (IQR) | 2987793114 |
Descriptive statistics
| Standard deviation | 1298947155 |
|---|---|
| Coefficient of variation (CV) | 0.5006701036 |
| Kurtosis | -1.264114282 |
| Mean | 2594417254 |
| Median Absolute Deviation (MAD) | 1028940018 |
| Skewness | 0.02456380133 |
| Sum | 3.526850815 × 1013 |
| Variance | 1.687263712 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3044520400 | 50 | 0.4% |
| 1009720001 | 37 | 0.3% |
| 4163500400 | 29 | 0.2% |
| 0 | 25 | 0.2% |
| 3020230050 | 17 | 0.1% |
| 3020230001 | 15 | 0.1% |
| 3001180006 | 14 | 0.1% |
| 4038100350 | 13 | 0.1% |
| 3000380001 | 13 | 0.1% |
| 3018087503 | 12 | 0.1% |
| Other values (6709) | 13369 | |
| (Missing) | 672 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 25 | |
| 1000010010 | 11 | |
| 1000020002 | 5 | < 0.1% |
| 1000047501 | 2 | < 0.1% |
| 1000110017 | 2 | < 0.1% |
| 1000130027 | 4 | < 0.1% |
| 1000157501 | 8 | 0.1% |
| 1000157502 | 3 | < 0.1% |
| 1000160120 | 2 | < 0.1% |
| 1000160125 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5080460194 | 1 | |
| 5080460094 | 1 | |
| 5080460090 | 2 | |
| 5080260118 | 1 | |
| 5080210019 | 1 | |
| 5079300008 | 1 | |
| 5079290015 | 1 | |
| 5079280085 | 1 | |
| 5079120083 | 1 | |
| 5079110028 | 1 |
| Distinct | 193 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Memory size | 111.6 KiB |
| MN17 | 700 |
|---|---|
| MN13 | 539 |
| MN24 | 535 |
| QN31 | 404 |
| BK38 | 371 |
| Other values (188) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 56580 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MN25 |
|---|---|
| 2nd row | MN25 |
| 3rd row | MN25 |
| 4th row | MN25 |
| 5th row | MN25 |
Common Values
| Value | Count | Frequency (%) |
| MN17 | 700 | 4.9% |
| MN13 | 539 | 3.8% |
| MN24 | 535 | 3.8% |
| QN31 | 404 | 2.8% |
| BK38 | 371 | 2.6% |
| BK73 | 291 | 2.0% |
| MN23 | 276 | 1.9% |
| BK37 | 256 | 1.8% |
| MN25 | 240 | 1.7% |
| MN12 | 219 | 1.5% |
| Other values (183) | 10314 |
Length
| Value | Count | Frequency (%) |
| mn17 | 700 | 4.9% |
| mn13 | 539 | 3.8% |
| mn24 | 535 | 3.8% |
| qn31 | 404 | 2.9% |
| bk38 | 371 | 2.6% |
| bk73 | 291 | 2.1% |
| mn23 | 276 | 2.0% |
| bk37 | 256 | 1.8% |
| mn25 | 240 | 1.7% |
| mn12 | 219 | 1.5% |
| Other values (183) | 10314 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 7564 | |
| B | 5672 | |
| M | 4534 | 8.0% |
| 1 | 4534 | 8.0% |
| K | 4512 | 8.0% |
| 3 | 4451 | 7.9% |
| 2 | 4208 | 7.4% |
| 7 | 3109 | 5.5% |
| Q | 3030 | 5.4% |
| 4 | 2605 | 4.6% |
| Other values (8) | 12361 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 28290 | |
| Decimal Number | 28290 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4534 | |
| 3 | 4451 | |
| 2 | 4208 | |
| 7 | 3109 | |
| 4 | 2605 | |
| 5 | 2205 | |
| 0 | 2040 | |
| 8 | 1995 | |
| 6 | 1792 | 6.3% |
| 9 | 1351 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 7564 | |
| B | 5672 | |
| M | 4534 | |
| K | 4512 | |
| Q | 3030 | |
| X | 1160 | 4.1% |
| S | 909 | 3.2% |
| I | 909 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28290 | |
| Common | 28290 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4534 | |
| 3 | 4451 | |
| 2 | 4208 | |
| 7 | 3109 | |
| 4 | 2605 | |
| 5 | 2205 | |
| 0 | 2040 | |
| 8 | 1995 | |
| 6 | 1792 | 6.3% |
| 9 | 1351 | 4.8% |
Latin
| Value | Count | Frequency (%) |
| N | 7564 | |
| B | 5672 | |
| M | 4534 | |
| K | 4512 | |
| Q | 3030 | |
| X | 1160 | 4.1% |
| S | 909 | 3.2% |
| I | 909 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 7564 | |
| B | 5672 | |
| M | 4534 | 8.0% |
| 1 | 4534 | 8.0% |
| K | 4512 | 8.0% |
| 3 | 4451 | 7.9% |
| 2 | 4208 | 7.4% |
| 7 | 3109 | 5.5% |
| Q | 3030 | 5.4% |
| 4 | 2605 | 4.6% |
| Other values (8) | 12361 |
| Distinct | 193 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 121 |
| Missing (%) | 0.8% |
| Memory size | 111.6 KiB |
| Midtown-Midtown South | 700 |
|---|---|
| Hudson Yards-Chelsea-Flatiron-Union Square | 539 |
| SoHo-TriBeCa-Civic Center-Little Italy | 535 |
| Hunters Point-Sunnyside-West Maspeth | 404 |
| DUMBO-Vinegar Hill-Downtown Brooklyn-Boerum Hill | 371 |
| Other values (188) |
Length
| Max length | 56 |
|---|---|
| Median length | 39 |
| Mean length | 21.61102863 |
| Min length | 6 |
Characters and Unicode
| Total characters | 305688 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Battery Park City-Lower Manhattan |
|---|---|
| 2nd row | Battery Park City-Lower Manhattan |
| 3rd row | Battery Park City-Lower Manhattan |
| 4th row | Battery Park City-Lower Manhattan |
| 5th row | Battery Park City-Lower Manhattan |
Common Values
| Value | Count | Frequency (%) |
| Midtown-Midtown South | 700 | 4.9% |
| Hudson Yards-Chelsea-Flatiron-Union Square | 539 | 3.8% |
| SoHo-TriBeCa-Civic Center-Little Italy | 535 | 3.8% |
| Hunters Point-Sunnyside-West Maspeth | 404 | 2.8% |
| DUMBO-Vinegar Hill-Downtown Brooklyn-Boerum Hill | 371 | 2.6% |
| North Side-South Side | 291 | 2.0% |
| West Village | 276 | 1.9% |
| Park Slope-Gowanus | 256 | 1.8% |
| Battery Park City-Lower Manhattan | 240 | 1.7% |
| Upper West Side | 219 | 1.5% |
| Other values (183) | 10314 |
Length
| Value | Count | Frequency (%) |
| east | 1311 | 3.9% |
| south | 1264 | 3.8% |
| park | 1166 | 3.5% |
| hill | 1067 | 3.2% |
| north | 878 | 2.6% |
| west | 761 | 2.3% |
| heights | 735 | 2.2% |
| square | 720 | 2.2% |
| midtown-midtown | 700 | 2.1% |
| village | 594 | 1.8% |
| Other values (255) | 24153 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 24759 | 8.1% |
| o | 21067 | 6.9% |
| t | 20707 | 6.8% |
| 19204 | 6.3% | |
| a | 18899 | 6.2% |
| n | 18256 | 6.0% |
| i | 18222 | 6.0% |
| r | 17834 | 5.8% |
| l | 16341 | 5.3% |
| s | 13837 | 4.5% |
| Other values (45) | 116562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 225897 | |
| Uppercase Letter | 48180 | 15.8% |
| Space Separator | 19204 | 6.3% |
| Dash Punctuation | 11994 | 3.9% |
| Other Punctuation | 277 | 0.1% |
| Open Punctuation | 68 | < 0.1% |
| Close Punctuation | 68 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 24759 | |
| o | 21067 | |
| t | 20707 | |
| a | 18899 | 8.4% |
| n | 18256 | 8.1% |
| i | 18222 | 8.1% |
| r | 17834 | 7.9% |
| l | 16341 | 7.2% |
| s | 13837 | 6.1% |
| d | 8641 | 3.8% |
| Other values (15) | 47334 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 5904 | |
| S | 5840 | |
| B | 5043 | 10.5% |
| C | 4832 | 10.0% |
| M | 3923 | 8.1% |
| P | 2722 | 5.6% |
| E | 1998 | 4.1% |
| W | 1935 | 4.0% |
| N | 1674 | 3.5% |
| L | 1559 | 3.2% |
| Other values (14) | 12750 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 160 | |
| ' | 117 |
Space Separator
| Value | Count | Frequency (%) |
| 19204 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11994 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 68 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 68 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 274077 | |
| Common | 31611 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 24759 | 9.0% |
| o | 21067 | 7.7% |
| t | 20707 | 7.6% |
| a | 18899 | 6.9% |
| n | 18256 | 6.7% |
| i | 18222 | 6.6% |
| r | 17834 | 6.5% |
| l | 16341 | 6.0% |
| s | 13837 | 5.0% |
| d | 8641 | 3.2% |
| Other values (39) | 95514 |
Common
| Value | Count | Frequency (%) |
| 19204 | ||
| - | 11994 | |
| . | 160 | 0.5% |
| ' | 117 | 0.4% |
| ( | 68 | 0.2% |
| ) | 68 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 305688 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 24759 | 8.1% |
| o | 21067 | 6.9% |
| t | 20707 | 6.8% |
| 19204 | 6.3% | |
| a | 18899 | 6.2% |
| n | 18256 | 6.0% |
| i | 18222 | 6.0% |
| r | 17834 | 5.8% |
| l | 16341 | 5.3% |
| s | 13837 | 4.5% |
| Other values (45) | 116562 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| JOB FILING NAME | JOB TYPE | BIN | BOROUGH | HOUSE NO | STREET NAME | BLOCK | LOT | ZIP CODE | SUBMITTED DATE | C OF O STATUS | C OF O SEQUENCE # | C OF O FILING TYPE | COMMUNITY BOARD | C OF O ISSUANCE DATE | APPLICATION NUMBER | xCoordinate | yCoordinate | latitude | longitude | communityDistrict | communityDistrictBoroughCode | communityDistrictNumber | cityCouncilDistrict | censusTract2010 | buildingIdentificationNumber | bbl | nta | ntaName | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 01 | ALTERATION TYPE 1 | 1000003 | MANHATTAN | 10 | SOUTH STREET | 2.0 | 2.0 | 10004.0 | 01/25/2022 12:00:00 AM | CO Issued | 14504 | Renewal Without Change | 101.0 | 01/26/22 3:49:10 PM | CO-000014504 | 981025.0 | 194923.0 | 40.701695 | -74.011631 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000003.0 | 1.000020e+09 | MN25 | Battery Park City-Lower Manhattan |
| 1 | 01 | ALTERATION TYPE 1 | 1000003 | MANHATTAN | 10 | SOUTH STREET | 2.0 | 2.0 | 10004.0 | 01/27/2022 12:00:00 AM | CO Issued | 14626 | Renewal With Change | 101.0 | 03/17/22 10:17:02 AM | CO-000014626 | 981025.0 | 194923.0 | 40.701695 | -74.011631 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000003.0 | 1.000020e+09 | MN25 | Battery Park City-Lower Manhattan |
| 2 | 01 | ALTERATION TYPE 1 | 1000003 | MANHATTAN | 10 | SOUTH STREET | 2.0 | 2.0 | 10004.0 | 05/03/2021 12:00:00 AM | CO Issued | 2499 | Renewal With Change | 101.0 | 05/27/21 3:51:49 PM | CO-000002499 | 981025.0 | 194923.0 | 40.701695 | -74.011631 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000003.0 | 1.000020e+09 | MN25 | Battery Park City-Lower Manhattan |
| 3 | 01 | ALTERATION TYPE 1 | 1000003 | MANHATTAN | 10 | SOUTH STREET | 2.0 | 2.0 | 10004.0 | 08/13/2021 12:00:00 AM | CO Issued | 6765 | Renewal Without Change | 101.0 | 08/20/21 3:25:28 PM | CO-000006765 | 981025.0 | 194923.0 | 40.701695 | -74.011631 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000003.0 | 1.000020e+09 | MN25 | Battery Park City-Lower Manhattan |
| 4 | 01 | ALTERATION TYPE 1 | 1000003 | MANHATTAN | 10 | SOUTH STREET | 2.0 | 2.0 | 10004.0 | 11/16/2021 12:00:00 AM | CO Issued | 11434 | Renewal Without Change | 101.0 | 11/24/21 9:58:25 AM | CO-000011434 | 981025.0 | 194923.0 | 40.701695 | -74.011631 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000003.0 | 1.000020e+09 | MN25 | Battery Park City-Lower Manhattan |
| 5 | 01 | ALTERATION TYPE 1 | 1000005 | MANHATTAN | 1 | NEW YORK PLAZA | 4.0 | 7501.0 | 10004.0 | 04/13/2021 12:00:00 AM | CO Issued | 1679 | Final | 101.0 | 08/27/21 10:03:44 AM | CO-000001679 | 980767.0 | 195231.0 | 40.702540 | -74.012562 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000005.0 | 1.000048e+09 | MN25 | Battery Park City-Lower Manhattan |
| 6 | 01 | ALTERATION TYPE 1 | 1000005 | MANHATTAN | 1 | NEW YORK PLAZA | 4.0 | 7501.0 | 10004.0 | 09/13/2021 12:00:00 AM | CO Issued | 8582 | Final | 101.0 | 10/15/21 11:03:48 AM | CO-000008582 | 980767.0 | 195231.0 | 40.702540 | -74.012562 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000005.0 | 1.000048e+09 | MN25 | Battery Park City-Lower Manhattan |
| 7 | 01 | ALTERATION TYPE 1 | 1000037 | MANHATTAN | 74 | BROAD STREET | 11.0 | 17.0 | 10004.0 | 08/20/2021 12:00:00 AM | CO Issued | 7521 | Renewal Without Change | 101.0 | 09/29/21 8:48:28 AM | CO-000007521 | 981042.0 | 196000.0 | 40.704651 | -74.011570 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000037.0 | 1.000110e+09 | MN25 | Battery Park City-Lower Manhattan |
| 8 | 01 | ALTERATION TYPE 1 | 1000037 | MANHATTAN | 74 | BROAD STREET | 11.0 | 17.0 | 10004.0 | 09/30/2021 12:00:00 AM | CO Issued | 9343 | Final | 101.0 | 01/25/22 4:10:34 PM | CO-000009343 | 981042.0 | 196000.0 | 40.704651 | -74.011570 | 101.0 | 1.0 | 1.0 | 1.0 | 9.0 | 1000037.0 | 1.000110e+09 | MN25 | Battery Park City-Lower Manhattan |
| 9 | 01 | ALTERATION TYPE 1 | 1000045 | MANHATTAN | 25 | BROADWAY | 13.0 | 27.0 | 10004.0 | 03/01/2022 12:00:00 AM | CO Issued | 15797 | Renewal Without Change | 101.0 | 03/01/22 2:57:15 PM | CO-000015797 | 980542.0 | 196401.0 | 40.705752 | -74.013374 | 101.0 | 1.0 | 1.0 | 1.0 | 13.0 | 1000045.0 | 1.000130e+09 | MN25 | Battery Park City-Lower Manhattan |
Last rows
| JOB FILING NAME | JOB TYPE | BIN | BOROUGH | HOUSE NO | STREET NAME | BLOCK | LOT | ZIP CODE | SUBMITTED DATE | C OF O STATUS | C OF O SEQUENCE # | C OF O FILING TYPE | COMMUNITY BOARD | C OF O ISSUANCE DATE | APPLICATION NUMBER | xCoordinate | yCoordinate | latitude | longitude | communityDistrict | communityDistrictBoroughCode | communityDistrictNumber | cityCouncilDistrict | censusTract2010 | buildingIdentificationNumber | bbl | nta | ntaName | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 14256 | I1 | New Building | 5175076 | STATEN ISLAND | 173GAR | SOMMER AVENUE | 2223.0 | 13.0 | 10314.0 | 11/01/2021 12:00:00 AM | CO Issued | 10771 | Final | 502.0 | 11/12/21 12:34:07 PM | CO-000010771 | 937368.0 | 160894.0 | 40.608169 | -74.168845 | 502.0 | 5.0 | 2.0 | 50.0 | 29103.0 | NaN | NaN | SI05 | New Springville-Bloomfield-Travis |
| 14257 | I1 | New Building | 5175080 | STATEN ISLAND | 38 | ATLANTIC AVENUE | 3293.0 | 28.0 | 10304.0 | 04/11/2022 12:00:00 AM | CO Issued | 18093 | Final | 502.0 | 04/25/22 2:48:45 PM | CO-000018093 | 957318.0 | 155843.0 | 40.594389 | -74.096975 | 502.0 | 5.0 | 2.0 | 50.0 | 9602.0 | NaN | NaN | SI36 | Old Town-Dongan Hills-South Beach |
| 14258 | I1 | New Building | 5175081 | STATEN ISLAND | 38GAR | ATLANTIC AVENUE | 3293.0 | 28.0 | 10304.0 | 04/11/2022 12:00:00 AM | CO Issued | 18094 | Final | 502.0 | 04/25/22 1:24:20 PM | CO-000018094 | 957318.0 | 155843.0 | 40.594389 | -74.096975 | 502.0 | 5.0 | 2.0 | 50.0 | 9602.0 | NaN | NaN | SI36 | Old Town-Dongan Hills-South Beach |
| 14259 | I1 | New Building | 5175083 | STATEN ISLAND | 42GAR | ATLANTIC AVENUE | 3293.0 | 30.0 | 10304.0 | 04/12/2022 12:00:00 AM | CO Issued | 18097 | Final | 502.0 | 04/25/22 2:49:46 PM | CO-000018097 | 957340.0 | 155827.0 | 40.594345 | -74.096896 | 502.0 | 5.0 | 2.0 | 50.0 | 9602.0 | NaN | NaN | SI36 | Old Town-Dongan Hills-South Beach |
| 14260 | I1 | New Building | 5863165 | STATEN ISLAND | 1 | EVENTS PLAZA | 9999.0 | 1.0 | 10301.0 | 02/02/2022 12:00:00 AM | CO Issued | 14949 | Renewal Without Change | 501.0 | 02/02/22 9:42:56 PM | CO-000014949 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 14261 | I1 | New Building | 5863165 | STATEN ISLAND | 1 | EVENTS PLAZA | 9999.0 | 1.0 | 10301.0 | 02/11/2022 12:00:00 AM | CO Issued | 15415 | Renewal Without Change | 501.0 | 02/11/22 7:02:48 PM | CO-000015415 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 14262 | I1 | New Building | 5863165 | STATEN ISLAND | 1 | EVENTS PLAZA | 9999.0 | 1.0 | 10301.0 | 02/25/2022 12:00:00 AM | CO Issued | 16063 | Renewal With Change | 501.0 | 02/25/22 9:21:51 PM | CO-000016063 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 14263 | I1 | New Building | 5863165 | STATEN ISLAND | 1 | EVENTS PLAZA | 9999.0 | 1.0 | 10301.0 | 08/06/2021 12:00:00 AM | CO Issued | 6833 | Renewal With Change | 501.0 | 08/06/21 9:40:02 PM | CO-000006833 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 14264 | I1 | New Building | 5863165 | STATEN ISLAND | 1 | EVENTS PLAZA | 9999.0 | 1.0 | 10301.0 | 12/10/2021 12:00:00 AM | CO Issued | 12547 | Renewal With Change | 501.0 | 12/10/21 9:05:21 PM | CO-000012547 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 14265 | I1 | New Building | 5863165 | STATEN ISLAND | 1 | EVENTS PLAZA | 9999.0 | 1.0 | 10301.0 | 12/14/2021 12:00:00 AM | CO Issued | 12617 | Renewal With Change | 501.0 | 12/14/21 5:42:06 PM | CO-000012617 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |